support lepton launching + add train stream script for esm2#1663
support lepton launching + add train stream script for esm2#1663jwilber wants to merge 2 commits into
Conversation
Signed-off-by: jwilber <jwilber@nvidia.com>
|
Important Review skippedAuto reviews are disabled on this repository. Please check the settings in the CodeRabbit UI or the ⚙️ Run configurationConfiguration used: Path: .coderabbit.yaml Review profile: CHILL Plan: Enterprise Run ID: You can disable this status message by setting the Use the checkbox below for a quick retry:
✨ Finishing Touches🧪 Generate unit tests (beta)
Comment |
|
/ok to test 71e04fb |
polinabinder1
left a comment
There was a problem hiding this comment.
LGTM once checks pass
|
Is this the same pathway we do for other recipes integrations to Lepton? or this is a brand new pathway? |
|
@trvachov Same pathway. the extra code you see here is because we don't want to save activations for CI, because those datasets are so large. so instead, we extract + train as we go, never materializing them to disk |
Adding lepton support for sae recipes.
Starting with ESM2 SAE, which I added producer/consumer support (
train_streaminghere) so no activation dataset needs to be extracted/saved during training.